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(£) A computer system for executing branch instructions 



@ A computer system for executing branch in- 
structions and a method of executing branch 
instructions are described. Two instruction 
fetchers respectively fetch a sequence of in- 
structions from memory for execution and a 
sequence of instructions commencing from a 
target location identified by a set branch in- 
struction in a sequence of instructions being 
executed. When an effect branch signal is gen- 
erated, the target instructions are next exec- 
uted, and the fetcher which was fetching the 
instructions for execution commences fetching 
of the target instructions. 

The effect branch signal is generated sepa- 
rately from the set branch instruction. 

In another aspect, the effect branch signal is 
generated on execution of a conditional effect 
branch instruction, located at the point in the 
instruction sequence where the branch is to be 
taken. 
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This invention relates to a computer system for executing branch instructions and to a method for operating 
such a comput r system. 

Programs consist of blocks or strings of sequential instructions, which have a single entry point (the first 
instruction) and a single exit point (the last instruction). There are one or two choices of instruction blocks to 
5 be executed after any particular block. When there are two possible blocks, a condition must be used to de- 
termine which block to choose. The pattern of links between blocks is called the program's control or flow 
graph. 

These blocks of instructions are packed together in memory. When there is no choice of subsequent block 
(bioclcB)rit^rrnormaH^^ 

10 be any explicit change in control to get from block A to block B. Sometimes this is not possible, for instance, 
if more than one block has block B as a successor. All but one of these predecessors must indicate that the 
subsequent block will not be the next sequential block, but block B. These are unconditional branches. Some 
blocks have a choice of successor blocks. Clearly only one of the successors, for example block B, can be 
placed sequentially afterwards. The other block, block C, is indicated explicitly within block A. A conditional 

15 mechanism is used to determine which block is to be chosen. If the condition is met, then the chosen successor 
block is block C. If the condition is not met, then the chosen successor is block B. These are conditional branch- 
es. 

Branches are well known in the art and are essential for a computer system to execute any program. Known 
computer systems contain a special register, the instruction pointer register, which provides an indication of 

20 the address of the next instruction to execute. This register is usually automatically incremented after an in- 
struction executes, so that it now indicates the address of the next sequential instruction. Branch instructions 
are used to change this behaviour. These branch instructions specify an alternative address (the target loca- 
tion) for the next executable instruction. Conditional branch instructions also specify a condition which must 
be met for the alternative address to be used - otherwise the instruction pointer will be incremented as usual. 

25 These branch instructions thus define the end of a block of instructions. 

In a non-pipelined computer system, the computer fetches, decodes and executes to completion one in- 
struction, before moving on to the next instruction. However, in a pipelined system where fetch, decode and 
execution stages can all operate simultaneously on a stream of instructions, it is possible to fetch instructions 
which are not required. For instance, consider a system with a four stage instruction pipeline with fetch, decode, 

30 execute and write stages. The earliest that a branch instruction can be detected is in the decode stage, by 
which time the next sequential instruction in memory will have already been fetched. For an unconditional 
branch this must be thrown away, and new instructions fetched from the target location. For conditional branch- 
es it is more complicated. The condition must be evaluated to determine whether or not to change to the target 
location. This will occur in the execute stage, thus the sequentially fetched instruction must be stalled in the 

35 fetch stage, and only after the branch has been executed can the pipeline proceed. If the condition was tru , 
then the sequentially fetched instruction must be ignored, and new instructions fetched from the target location. 
The first pipelining applied to any processor architecture is to issue instructions in advance, as this is one of 
the easiest speed-ups. 

From the previous description, it is clear that the instruction after a branch instruction is always fetched, 
40 but is only sometimes required, and that therefore a pipeline bubble is created while determining what to do. 
An attempt has been made to improve this by changing the semantics of branch instructions, so that the sub- 
sequent instruction is always executed and the branch determines whether the instruction executed after that 
one is the one sequentially after it, or the instruction at the target location. These are called delayed branches, 
and the instruction immediately following the branch instruction is called a branch delay slot Figure 1 illustrates 
45 schematically this operation. The branch instruction is detected in the decode stage. The branch delay slot is 
Inst 1 , which is always executed. If the branch is taken, then the next executed instruction will be Inst DO being 
the first instruction of a different block, whereas if the branch is not taken, it will be Inst 2 which is the first 
instruction of the next sequential block. Inst 1 must be an instruction which can always be executed, regardless 
of the outcome of the (conditional) branch, and it must not be an instruction which determines whether the 
so conditional branch is to be taken. If no instruction can be found within the program which satisfies these con- 
ditions, then an instruction which has no effect (NO OP) must be inserted instead. * - -« - 

Pipelines can be designed where the optimum number of delay slots is more than one. The more deeply 
pipelined a computer is, the more delay slots are generally required. Unfortunately, it gets harder and harder 
to find useful instructions to put in each additional slot, so many of them are filled with instructions which do 
55 nothing. This places large bubbles of NO OP instructions in the execution pipeline, thus reducing the speed 
advantage obtained by making a deep pipeline. 

Another significant problem with this approach is that when a new computer system of an existing instruc- 
tion set is designed, with a new pipeline organization, and therefore a different number of branch delay slots, 
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it cannot execute existing binaries. Programs must be recompiled in order to be executed. 

In an attempt to dispense with branch delay slots, one known system uses two instruction fetchers at the 
fetch stage of a pipeline, each instruction fetcher being capable of fetching and holding a s quence of instruc- 
tions. One instruction fetcher has associated with it local decode circuitry which is arranged to detect branch 

5 instructions. It will be appreciated that this local decode circuitry is in addition to the norma! decode stage of 
the pipeline. When a branch instruction is detected by the active fetcher it initialises the other instruction fetcher 
to start fetching instructions from the new block while the instructions up to the branch instruction of the first 
block continue to be passed into the pipeline for decoding and execution. Not only does this system require 

extraiocal-decodercircuitry to detect branctiinst ructions prior to the normal decode stage ofthe pipeline, but 

10 it also involves speculative fetching of instructions from the memory, many of which may not be required. 

EP-A-355069 (Evans & Sutherland Computer Corporation) defines a system in which there is separation 
of the instruction to effect a branch into two different parts. The set branch instruction indicates the target 
location for the branch and can be placed as near the beginning of the string of instructions as possible. 
Actual implementation ofthe branch is carried out later in response to a split bit located in a later instruction. 

15 The provision of the target location for the branch with the set branch instruction provides an early indi- 

cation of the fact that a memory access is going to be made (or is likely to be made) and provides the memory 
address (the target location) for that access. When the split bit causes the branch to be taken, and the time 
comes therefore to access that memory address, the system has had a chance to set up for the access, for 
example by bringing the necessary data into a local cache. 

20 One problem associated with the system of EP-A-355069 is that the target location from which new in- 

structions are fetched is reset after a split bit signal has been executed. This means that there cannot be mul- 
tiple branches using the target location set up by a single set branch instruction, it is advantageous to allow 
for this situation and it is one object of the present invention to provide an improved system for implementing 
branches allowing for this. 

25 According to one aspect of the present invention there is provided a computer system for fetching, decoding 

and executing instructions comprising: 

storage circuitry for holding a plurality of instructions at respective storage locations, said plurality of 
instructions being arranged in instruction strings, each string comprising a first instruction and a set of sub- 
sequent instructions; 

30 instruction fetch circuitry forfetching a sequence of instructions from said storage circuitry and including 

an indicator for providing an indication of a next address at which a next fetch operation is to be effected; 

execution circuitry for executing fetched instructions, wherein at least some of said instruction strings 
each includes a set branch instruction (SET) which provides an indication of a target location from which a 
subsequent instruction may be fetched, the subsequent instruction being from a different instruction string, 

35 and wherein said instruction fetch circuitry is operated responsive to execution of a said set branch instruction 
(SET) to fetch in parallel subsequent instructions from said string containing said set branch instruction and 
new instructions from said different instruction string commencing from said target location while said sub- 
sequent instructions continue to be executed; 

a target store for holding the indication of said target location, said indication being loaded into said store 

40 on execution of said set branch instruction (SET) and being held in said store as a valid indication until exe- 
cution of a subsequent set branch instruction; and 

select circuitry responsive to generation of an effect branch (DO) signal indicative that further instruc- 
tions to be executed are said new instructions, to cause said execution circuitry to execute said new instruc- 
tions and to cause said instruction fetch circuitry to fetch again new instructions commencing from said target 

45 location. 

The invention also provides a method of operating a computer to fetch decode and execute instructions 
which computer has storage circuitry holding a plurality of instructions at respective storage locations, said 
plurality of instructions being arranged in instruction strings, each string comprising a first instruction and a 
set of subsequent instructions the method comprising: 
so fetching a sequence of instructions from said storage circuitry and providing an indication of a next ad- 

dress at which a next fetch operation is to be effected; 
decoding said instrutions; 

executing each instruction in turn, wherein at least some of said instruction strings each include a set 
branch instruction (SET) which provides an indication of a target location from which a subsequent instruction 
55 may be fetched, the subsequent instruction being from a different instruction string; 

on execution of said set branch instruction, holding the indication of said target location in a target store 
as a valid indication until execution of a subsequent set branch instruction, fetching in parallel subsequent in- 
structions from the string containing said branch instruction and new instructions from said different instruc- 
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tion string commencing from said target location; 

continuing to execute said subsequent instructions until an effect branch signal is generat d which in- 
dicates that further instructions to be executed are said new instructions; and 

responding to said effect branch signal by commencing execution of said new instructions and fetching 
5 again new instructions commencing from said target location. 

In one embodiment said instruction fetch circuitry comprises two instruction buffers, a first buffer for hold- 
ing subsequent instructions connected to said execution circuitry, and a second buffer for holding new instruc- 
tions wherein the contents of said second buffer are copied into said first buffer responsive to generation of 

said-effectrbranch-fDO)sTgnal; 

10 In the described embodiment said instruction fetch circuitry includes two instruction fetchers for fetching 

respectively said subsequent instructions and said new instructions and wherein said select circuitry is oper- 
able to connect a selected one of said instruction fetchers to said execution circuitry. 

In the simplest case, the target store can hold the memory address of the target location. To allow kernel 
entry, the set branch instruction can identify the target location using an implicit value which addresses a spe- 
15 cial register holding the memory address of the new instructions. 

To allow descriptor branches to be executed, the target store can hold a pointer to a memory location which 
contains the memory address of the target location. 

The effect branch signal is generated when the branch point, at which the branch is to be taken, is iden- 
tified. This can be done in a number of ways. For example, a further instruction can be located in the string 
20 of instructions being executed prior to the branch point in which case said further instruction will identify the 
branch point which will be held in a branch point register. The contents of the branch point register can then 
be compared with an instruction pointer register holding an indication of the address from which a next instruc- 
tion would normally be fetched and when the two are equal the effect branch signal is generated. Alternative 
methods for identifying the branch point are also discussed herein. 
25 The provision of a further instruction which identifies the branch point but which is located before the 

branch point reduces the number of unwanted instructions which will be fetched before the branch is taken. 

As a still further alternative, the set branch instruction itself can identify the branch point which is stored 
in the branch point register, thereby obviating the need for a further instruction. 

However, in a particularly preferred embodiment, the branch point is identified by a further, dedicated in- 
30 struction, different to the set branch instruction, which is located at the branch point in the string of instructions 
being executed. To allow for additional branches to be effected, this effect branch instruction can itself define 
the condition to be satisfied so that a branch is only taken if the condition is satisfied and is not taken if the 
condition not satisfied. 

This provides a further technical advantage over the system of EP-A-355069 discussed above. In that sys- 
35 tern, the set branch instruction itself must indicate whether or not the branch is conditional or not and cause 
various different condition detectors to be in a ready state, ready to sense a condition. The condition itself is 
defined in an instruction different to the set branch instruction and to the split bit instruction. 

To avoid the need for state indicators, the present invention provides in another aspect a computer system 
for fetching, decoding and executing instructions comprising: 
40 storage circuitry for holding a plurality of instructions at respective storage locations, said plurality of 

instructions being arranged in instruction strings, each string comprising a first instruction and a set of sub- 
sequent instructions; 

instruction fetch circuitry for fetching a sequence of instructions from said storage circuitry and including 
an indicator for providing an indication of a next address at which a next fetch operation is to be effected; 

45 execution circuitry for executing fetched instructions, wherein at least one of said instruction strings 

includes a set branch instruction (SET) which provides an indication of a target location from which a subse- 
quent instruction may be fetched, the subsequent instruction being from a different instruction string, and an 
effect branch instruction different from said set branch instruction and located at the branch point after which 
said new instructions are to be executed and wherein said instruction fetch circuitry is operated responsive 

50 to execution of a said set branch instruction (SET) to fetch in parallel subsequent instructions from said string 
containing said set branch instruction and new instructions from said different instruction string commencing 
from said target location while said subsequent instructions continue to be executed; and 

select circuitry responsive to execution of a said effect branch (DO) instruction to cause said execution 
circuitry to execute said new instructions if a condition determined by the effect branch instruction is satisfied. 

55 The invention also provides in a further aspect a method of operating a computer to fetch decode and exe- 

cute instructions which computer has storage circuitry holding a plurality of instructions at respective storage 
locations, said plurality of instructions being arranged in instruction strings, each string comprising a first in- 
struction and a set of subsequent instructions the method comprising: 
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fetching a sequence of instructions from said storage circuitry and providing an indication of a next ad- 
dress at which a next fetch operation is to be effected; 
decoding said instructions; 

executing each instruction in turn, wherein at least one of said instruction strings includes a set branch 
5 instruction (SET) which provides an indication of a target location from which a subsequent instruction may 

be fetched, the subsequent instruction being from a different instruction string; 

on execution of said set branch instruction, fetching in parallel subsequent instructions from the string 

containing said branch instruction and new instructions from said different instruction string commencing from 

s^srttTtargerlucatton; 

10 continuing to execute said subsequent instructions until an effect branch instruction is executed which 

is located at the branch point after which new instructions are to be executed and which indicates that further 

instructions to be executed are said new instructions If a condition determined by the effect branch instruction 

is satisfied; and 

responding to said effect branch signal by commencing execution of said new instructions. 

15 As an alternative arrangement to enable the computer system to perform conditional branches, it can in- 

clude circuitry for holding a state indicator in one of a confirmed state and a rejected state. The confirmed state 
is one in which further instructions to be executed are new instructions commencing from the target location. 
The rejected state is one in which further instructions to be executed are subsequent instructions in memory 
and not new instructions. One of the states can be set responsive to execution of the set branch instruction 

20 and the other of the states can be selectively set responsive to execution of a second instruction different from 
the set branch Instruction and subject to a condition. 

The second instruction can be a confirm instruction which sets the confirmed state if the confirm condition 
is satisfied. 

Alternatively, the second instruction could be a reject instruction which sets the rejected state if the reject 
25 condition is satisfied. 

The provision of these reject or conf irm instructions allows a further improvement to be made in that the 
set branch instruction is the first instruction of the string and there is a plurality of contiguous instruction 
strings, with the set branch instruction acting as a further instruction to generate the effect branch signal if 
the state indicator is in the confirmed state. It will be appreciated that the set branch instruction acting as the 
30 further instruction will also change the state of the state indicator back to its original state. Preferably the con- 
firm/reject instruction can be placed as early as possible within the string (after the condition has been gen- 
erated) so that the execution circuitry can be given an early indication of which way the branch will go. 

For a better understanding of the present invention and to show how the same may be carried into effect, 
reference will now be made by way of example to the accompanying drawings in which:- 
35 Figure 1 is a schematic illustrating a known branching system; 

Figure 2 is a schematic illustrating the branch system of the present invention for non-conditional branches; 

Figure 3 is a schematic illustrating the branch system of the present invention for conditional branches; 

Figure 4 is a simple block diagram of a pipelined processor; 

Figure 5 is a circuit diagram of an instruction fetcher; 
40 Figure 6 is a circuit diagram of a computer system for implementing branch instructions; 

Figure 7 is a circuit diagram of an instruction fetcher with kernel and descriptor functions; 

Figure 8 is a schematic diagram illustrating procedure catling; 

Figure 9 is a sketch illustrating states for performing procedure calls; 

Figure 10 is a block diagram illustrating an alternative implementation for an instruction fetch circuit; 
45 Figure 11 is a block diagram illustrating a non predictive fetcher and 

Figure 12 is a block diagram illustrating a predictive fetcher. 

Reference will first be made to Figures 2 and 3 to explain the concept underlying the branching system 
of the present invention. Figure 2 illustrates three blocks of instructions in memory, Block A. Block B and Block 
C. Each block comprises a first instruction which in each case is a set branch instruction Set B, Set C, Set D, 

so respectively, a sequence of subsequent instructions for example Inst A1, lnstA2 ... lnstAi-1 in Block A and a 
last instruction which in each case is an effect branch instruction referred to herein as DO. Assume that the 
sequence of instructions in Block A is being fetched decoded and executed in a pipelined computer system. 
On execution of the first instruction Set B, a target location for a branch is stored, in this case identifying the 
memory address of the first instruction Set C of Block B. However, no action is taken at this stage other than 

55 to store the target location and possibly to set up the memory containing Block B for an access, for example 
by moving the relevant memory addresses to a local cache. The instructions in Block A continue to be fetched, 
decoded and executed until the last instruction, DO, is being executed. Execution of this instruction causes 
an effect branch signal to be generated which causes the execution unit to address as its next instruction the 
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target location set up by the set instruction Set B. Thus, th next instruction to be fetched from memory is the 
first instruction Set C of Block B. This is indicat d by the dotted arrow in Figure 2. 

Figure 2 illustrates the case for unconditional branches, that is branches that will inevitably be taken. Figure 
3 illustrates the position for conditional branches, that is branches that may or may not be taken depending 

5 on whether or not a condition which has been evaluated is satisfied. Figure 3 illustrates the case where a third 
instruction is used in addition to the set branch instruction and effect branch instruction described above with 
reference to Figure 2. In Figure 3 this third instruction is referred to as CONFIRM, although it will become clearer 
in the following that it is possible to implement conditional branches using a reject instruction with the opposite 

semantics; 

10 It is also possible to implement conditional branches using a conditional DO instruction. 

Figure 3 illustrates three sequences of instructions held in memory as Block A, Block B and Block C. Block 
B is shown contiguous to Block A and is arranged in memory such that if instructions are fetched from memory 
using sequential memory addresses then instructions will be normally fetched in the sequence of Block A fol- 
lowed by Block B. Block C is shown located elsewhere in memory. As in Figure 2, each block comprises a first 

15 instruction which is a set branch instruction (Set C in Block A Set B in Block B and Set E in Block C). Block 
A then additionally comprises a sequence of instructions to be executed including a conf irm instruction and 
the last instruction which is the effect branch instruction. As described above with reference to Figure 2, in- 
structions are fetched, decoded and executed. When the first instruction of Block A is executed it is identifed 
as a set branch instruction with a target location identifying the memory address of the first instruction Set E 

20 in Block C. Instructions in Block A continue to be fetched, decoded and executed until the confirm instruction 
is reached which has a condition associated with it. If the condition is satisfied, the branch is confirmed and 
execution of the effect branch instruction DO at the end of Block A will cause the branch identified by the target 
location to be taken as indicated by the dotted line in Figure 3. Thus, the next instruction to be fetched, decoded 
and executed will be the first instruction Set E of Block C. If the confirm condition is not satisfied, the branch 

25 will not be taken when the effect branch instruction is executed but instead the next instruction to be fetched, 
decoded and executed will be the first instruction Set D of Block B which sequentially follows Block A in mem- 
ory. It will readily be appreciated that once confirm instructions has been introduced, it will be necessary even 
to confirm unconditional branches such as that which is illustrated by way of example in Block C, where the 
branch is always confirmed and is not subject to a condition. 

30 It is assumed for the purposes of the present description that any useful computer system must be capable 

of implementing conditional branches in addition to unconditional branches. It will be appreciated that in ord r 
to implement branches as described above with reference to Figures 2 and 3, a target register must be provided 
for storing the target location indicated by the set branch instruction. Moreover, for conditional branches a state 
indicator must be provided to indicate whether the branch is in a conf irmed state or not. A detailed explanation 

35 of circuitry capable of implementing the present invention is given later. Firstly, there follows an explanation 
of the various different ways in which branch instructions in accordance with the present invention may be 
implemented. 

METHOD A 

40 

set <addr> ; specify branch target 

/•confirm the branch 
;execute it, if confirmed 

50 This is the scheme outlined above. There are three distinct instructions. One bit of state is needed to spec- 

ify whether the branch has been confirmed or not There could be multiple confirm instructions; the branch - 
being taken if the condition evaluated for any confirm instruction is true. 



55 



confirm <cond> 

45 



do 
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METHOD B 



set 



<addr > 



;specify branch target 



reject 



ccond> 



;optional conditional reject 
branch 



10 



do 



;do the branch if not rejected 



This mechanism is similar to Method A, except that the branch has to be rejected, rather than confirmed. 
15 As before, one bit of state is required to hold whether the branch has been rejected or not The set instruction 
initializes the branch target register, and sets the state to branch. The reject instruction conditionally sets the 
state to not branch. The do instruction executes the initialized branch, if the state is still at branch. With these 
semantics, the reject instruction is optional, unconditional branches would not need one. There can be more 
than one reject instruction, if any nullify the branch, then the branch will not be taken. 



20 



METHOD C 



25 



set 
• « • 
do 



<addr > 
<cond> 



;initialize branch target 
conditionally execute branch 



This method combines the confirm and do instructions, so that do becomes a conditional do. There is no 
30 requirement for a state machine to hotd whether the branch is to be taken or not. 

METHOD D 



35 



set 



<addr > 



; initialize branch target 



confirm <cond> 



conditionally confirm branch 



40 



do 



<of f set > 



/•perform branch some 
instruction later 



45 



50 



This scheme delays the effect of the do instruction by a variable amount (<offset>). This permits unnec- 
essary instruction fetching to be stopped at the end of the block of instructions. 

The end of the block (branch point) is indicated in the do instruction, either as an offset to the last instruction 
in the block, the first instruction in the subsequent block or as an instruction count to the end of the block. Two 
registers are required, one bit of state to record whether the branch has been confirmed or not, and a branch 
point register to hold the block termination address or count, as an indication of the branch point. 



55 
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METHOD E 



10 



set 

reject 



do 



<addr> initialize branch target 

<cond> ;optionally conditionally 

reject branch 



<offset> ;perform branch some 

instruction later 



15 



20 



25 



This scheme is similar to Method D, except that the branch is conditionally rejected, rather than confirmed. 
The reject instruction is optional. 



METHOD F 



set 



<of f set> , <addr> initialize target and branch 

point 



confirm <cond> 



;conditionally confirm branch 



This scheme is similar to Method D, except that the branch point is specified with the set instruction, thus 
30 removing the requirement for a do instruction. The branch still needs to be confirmed, otherwise it will be ig- 
nored. Two additional registers are required, one bit of state to record whether the branch has been confirmed 
or not, and a branch point register to hold the block termination offset or count as an indication of the branch 
point. 

35 METHOD G 

sefc <of f set> , <addr> initialise target and 

branch point 

confirm <cond> ;optionally conditionally 

reject branch 

This scheme is similar to Method F, except that the branch can optionally be rejected. Unconditional 
branches do not require a reject instruction. The same additional state is required. 



40 



45 



50 



METHOD H 



set <addr> initialize branch target 

55 do <cond> , <of f set> ; confirm and set branch point 

This method is a combination of Method C and Method D. The set instruction specifies the branch target 
location, whilst the do instruction specifies both a branch point and the condition on whether the branch is to 
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be executed. Two pieces of state are required to support this, a branch point register to hold the branch point 
and one to hold whether the branch is to be executed or not Both of these are set by the do instruction. If the 
condition is false, the processor stays in the unbranching state. Unconditional branches could either specify 
a true condition, or use a special unconditional do instruction. 
5 This method is interesting, as it does not require the branch target location to be set for each block, if the 

target of successive blocks is the same. This occurs in tight loops. For instance the following code is valid. 



loop ;set branch target to loop 

;loop entry point 
;loop code 

<cond> , <of f set> ;loop continuation condition 

;some more loop code 

The set instruction is executed only once. The same branch target is required for each iteration of the loop. 
Some of the above referenced methods can be modified still further. For example, it will be clear that for 

20 a plurality of contiguous blocks in memory as illustrated for example In Figure 3 where the set branch instruc- 
tion is the first instruction of a block and the do instruction is the last Instruction of a block, there will be at 
each block interface a do instruction Immediately followed by a set instruction. This is illustrated particularly 
where Block A meets Block B in Figure 3. For methods which hold an indication as to whether or not the system 
is in a branch state, it is possible to eliminate the do instruction at the end of a block and to rely on the set 

25 instruction at the beginning of a contiguous block. Thus, Methods A and B can be modified to eliminate a dif- 
ferent do instruction and to interpret the set instruction as follows. If the system is in a branch state, the sub- 
sequent set instruction at the beginning of the next block is executed as though it were a do instruction to effect 
the branch. If the system is not in a branch state when the set instruction at the beginning of the next block is 
executed, the branch will not be taken and the set instruction will be executed in the normal manner to set up 

30 a branch with a target location. 

There will now be described a computer system for implementing branching using a split branch instruction. 
Figure 4 is a simplified block diagram of a pipelined computer system. This shows a memory 41 which in this 
example comprises a conventional RAM. The computer system includes an address bus 39 and a data bus 43 
coupled to the memory 41 . It will be appreciated that read and write control signals for the memory are required 

35 but they are not illustrated in Figure 4. The memory 41 holds program comprising sequences of instructions 
at different addressable locations, as already described above with reference to Figures 2 and 3. The memory 
41 may also hold data. The data bus 43 carries data values to and from the memory 41 . The address bus 39 
carries memory address values for read or write operations. The computer system comprises an instruction 
fetch circuit 1 0 which is arranged to supply addresses to memory along address line 12 and to receive instruc- 

40 tionsfrom memory along data line 14. A decode circuit 16 is connected to receive instructions fetched by the 
fetch circuit and supplied on instruction line 22 and decodes them. The decode circuit 16 supplies instructions 
to an execution circuit 18 which executes instructions and controls a result write circuit 20 to write results of 
the execution into temporary registers 11. For the purposes of the present invention, the decode circuit 16, 
execution circuit 1 8, result write circuit 20 and registers 11 are conventional and are not described further here- 

45 in. They are referred to together in the following as the processor 17. 

The fetch circuit 1 0 is arranged to fetch four bytes at a time from the memory 41 and to provide instructions 
along instruction line 22 to the decode circuit 16. Where the instructions are of variable length, it will be ap- 
preciated that an alignment mechanism is required within the instruction fetch circuit to deal with instructions 
which are not exactly four bytes long and to correctly align these instructions. Circuitry to accomplish this is 

so described in our copending Application No. (Page White & Fairer Ref. 74893 Compressed Instruction Set). 
The present invention can be implemented with same length or variable length instruction sets. 

The fetch circuit 10 comprises two instruction fetchers, one of which will now be described with reference 
to Figure 5. The data line 14a and address line 12a are connected to the corresponding lines 14 and 12 illu- 
strated in Figure 4. The fetcher includes a fetch pointer 65 which comprises a 32 bit latch containing the next 

55 address in memory from which a 32 bit word is to be read. Whenever a value is read from memory, the fetch 
pointer 65 is normally increased by four bytes to a new pointer address via an increment unit 38. The fetcher 
is arranged to read instructions from memory on data line 14 into an instruction buffer 66 from which they are 
output along instruction line 22 to the processor 17. For variable length instructions, a length indicator 74 is 
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provided which receives on line 70 an indication from the instruction buffer 66 of the length of the instruction 
which has been output The length indicator 74 generates an amount signal 86 which indicates the number of 
bytes used by the instruction being output A byte counter 75 holds a count of the number of bytes in the in- 
struction buffer 66 at any one time and generates a select signal 24 which determines where in the instruction 
buffer 66 bytes fetched from the memory 41 are to be inserted. The instruction buffer 66 also receives a remove 
signal 26 which causes an instruction to be removed from the buffer. Unremoved instructions are shifted along, 
thereby creating space at the end of the buffer for more bytes from the memory 41 . The instruction buffer 66 
is also responsive to a store signal 28 which causes bytes from the memory to be stored in the buffer at the 
atiorrirrdicated-by-the-selects1gna|-24TThe s t o r e s tgnah28isnderiv ed f r om a latcfrsignal^3Q^whrchis~ased" 
to indicate that data can be stored from the memory. 

The fetch pointer 65 is supplied with an address on line 32 from a fetch multiplexor 34 which has one input 
connected to receive a next address on line 36 from the increment unit 38. The increment unit 38 updates the 
fetch address in the usual manner, in this example by four bytes each time. The other input of the fetch multi- 
plexor 34 is connected to a start line 40 on which is supplied a new address from which fetching is to be com- 
menced. It will readily be appreciated that for the purposes of branching the new address is derived from the 
target location identified in the set branch instruction. A begin signal 42 controls the fetch multiplexor 34 to 
start fetching from a new address on the start line 40. The fetch pointer 65 has its latch input connected to an 
OR gate 44 which receives the begin signal 42 and the latch signal 30 to control latching of subsequent ad- 
dresses into the fetch pointer 65. The latch signal 30 is also supplied to a gate 46 which is connected to the 
latch input of the byte counter 75 and which receives as its other input a next signal 48 which is used to indicate 
that an instruction has been read from the instruction buffer 66 and which thus latches the appropriate byte 
count. 

The begin signal 42 is also fed to the clear input of the byte counter 75 and to the clear input of a stop 
latch 50. The stop latch 50 receives on the Stop At line 52 the address after which no further instructions are 
to be fetched. This address is stored in the stop latch 50 under the control of a stop signal 54. The output of 
the stop latch 50 is fed to a comparator 56 which also receives the memory address on line 12. The comparator 
56 determines whether or not the memory address on line 12 has reached the address stored at the stop latch 
50. The output of the comparator 56 is supplied to a gate 58 which also receives the output from a check full 
unit 80. The check full unit 80 receives the output from the byte counter 75 and asserts its output whenever 
the buffer 66 is full. The output of the gate 58 provides a full signal on line 82 which indicates either that the 
instruction buffer 66 is full or that the stop address has been reached. No further memory requests will be 
performed when the full signal 82 is asserted. The output of the byte counter 75 is also fed to a second incre- 
ment unit 84 the output of which is fed to one input of a count multiplexor 85. The other input of the count mul- 
tiplexor 85 is supplied from a subtract unit 90 which receives the output from the byte counter 75 and the output 
from the length indicator 74. The subtract unit 90 asserts a more signal 92 whenever the instruction buffer 
does not contain at least one instruction. 

Operation of the instruction fetcher will now be described. A start address, which is the memory address 
identifying the first four bytes at the beginning of a block of instructions is supplied on line 40 to the fetch mul- 
tiplexor 34. The begin signal 42 is asserted to clear the stop latch 50 and to latch the start address into the 
fetch pointer 65. A memory access is made at that address and the first four bytes are supplied along data line 
1 4 to the instruction buffer 66. The latch signal 30 causes these four bytes to be stored in the buffer and latches 
a next address into the fetch pointer 65. This next address will now be the start address incremented by four 
bytes by the increment unit 38, unless a branch is being taken as described more fully later. The instruction 
buffer 66 supplies an instruction along line 22 and indicates the length of the instruction on line 70 to the length 
indicator 74. The output of the length indicator 74 supplies the amount signal 86 to indicate the size of the in- 
struction which has been removed and also supplies an input to the subtract unit 90 which provides a count 
to the byte counter 75 to enable it to assert the select signal 24 to determine where in the buffer the next four 
bytes fetched from memory should be stored. 

As has already been described, removal of an instruction from the buffer on line 22 is under the control 
of the next signal 48. If there is not at least one instruction held in the instruction buffer, the more signal 92 is 
asserted. If the instruction buffer is full, the check full unit 80 causes the full signal 82 to be asserted to prevent 
further memory accesses. 

The stop latch 50 is used to store an address which the fetcher should not advance past On commence- 
ment of fetching at the start address th stop latch 50 is cleared to an invalid address by the begin signal 42. 
Storing a special address in the stop latch 50 will stop fetching. The use of the stop latch 50 will become clearer 
in the following description of the application of the fetcher unit 

When a branch instruction has been fetched, decoded and executed, the target location identified by the 
branch instruction is used to provide the start address on line 40 for the instruction fetcher when the effect 
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branch signal has been supplied to cause a branch to be taken. The fetcherthus commences fetching of in- 
structions from the new target location. 

The fetch circuit 1 0 of Figure 4 includes two instruction fetchers of the type described above with reference 
to Figure 5. Reference will now be made to Figure 6 to describe in more detail how branch instructions control 
5 the activity of the instruction fetchers. In Figure 6, the fetchers are illustrated as Fetcher A and Fetcher B. The 
system includes an arbitrator unit 1 00 which arbitrates between Fetcher Aand Fetcher B for access to the mem- 
ory using the full and more signals from each fetcher to determine which requires data. The full and more sig- 
nals for Fetcher A are referred to at the arbitrator as more A and full A and the full and more signals for Fetcher 

B are referred to at the arbitrator as more B and f ull^rReferertce~nameralS" relating to FetcherA~conrespond 

w to those for the instruction fetcher shown in Figure 5. These numerals are not repeated again for Fetcher B, 
because this has precisely the same signals and operation as fetcher A. The address outputs 12a of the Fetch- 
ers Aand B are supplied through an address multiplexor 102 to the address line 1 2. The multiplexor is controlled 
by a Sel A signal on line 1 04 from the arbitrator 100 which gives priority to the fetcher which is currently fetching 
instructions for execution. It will be appreciated that one of the fetchers is used to fetch instructions in the cur- 
15 rent block (and potentially the next sequential block) and the other fetcher is used to fetch instructions com- 
mencing from the target location. Both fetchers receive instructions along data line 14 and supply their instruc- 
tions on line 22a to a select multiplexor 1 04. The output 22 of the multiplexor 104 is connected to the processor 
17 which decodes and executes instructions and writes the results to temporary registers 11. The processor 
17 supplies the next signal which is fed to the instruction fetchers on line 48. The processor 17 receives a wait 
20 signal on line 106 which is derived from the more signals 92 of the fetchers. As the processor 17 executes 
instructions it updates an instruction pointer register 1 08 to point to the next instruction to be executed by sup- 
plying an output on the new IP line 110 via a pointer multiplexor 112. A latch IP signal on line 114 Indicates 
when the instruction pointer register 108 is to be updated. The output of the instruction pointer register 108 
provides the current value of the instruction pointer on a current IP line 116. 
25 The computer system of Figure 6 also includes a target pointer register 118 for holding a target location 

identified by a set branch instruction and a branch pointer register 120 for holding a value identifying the point 
at which the branch is to be taken. The computer system also includes an active fetcher switch 122 which con- 
trols which of the fetchers A and B is supplying instructions to the processor 17. There is also a compare unit 
124 connected to receive the outputs from the branch pointer register 120 and the instruction pointer register 
30 108. A check output from the processor 17 on line 126 is fed via a gate 128 to control the pointer multiplexor 
112 and the active fetcher switch 122. The gate 128 also has an input on line 130 from the compare unit 124. 

The execution circuit of the processor 17 includes a set branch instruction execution unit 136 and a do 
instruction execution unit 142. The computer system also includes a state indicator 132 for holding an indication 
of the state of the system, that is whether it is in a branch state or not 
35 The remaining gates illustrated in Figure 6 are not described herein because they are illustrated for the 

sake of completeness only to more clearly demonstrate the connections between the signal lines. 

Operation of the computer system of Figure 6 will now be described. Assume that the processor 17 is to 
execute block A of Figure 3. The start address identifying the beginning of the first instruction of block A is 
supplied to fetcher A. The f irst instruction is supplied on line 22a to the multiplexor 104 which is controlled by 
40 a signal on line 134 from the active fetcher switch 122 so that instructions from fetcher A are supplied to proc- 
essor 1 7. The first instruction is a set branch instruction Set C which is identified by the set branch execution 
unit 136 forming part of the processor 17. The set branch execution unit 136 generates the appropriate target 
location on target line 138 and a set signal on line 140. The set signal is used to activate the begin signal 42 
for fetcher B so that that fetcher is initialised to commence starting to fetch instructions from the target location. 
45 The target location is also stored in the target pointer register 11 8. Instructions in block Acontinue to be supplied 
by fetcher A from memory to the processor 1 7 and executed. On execution of the conf irm instruction, the con- 
dition for the confirm instruction is evaluated and the state indicator 132 is appropriately set The condition 
line 145 indicates the state of the system which is selectively supplied to the stop lines 54 and the fetchers as 
described later. The following description assumes that the condition for the confirm state has been positive 
so so that the system is in a branch state. After fetching decoding and execution of further instructions up to and 
including lnst A-1 in block A, the next instruction is an effect branch instruction. This is executed by the do 
instruction execution unit 142 which forms part of the processor 17. On execution of the effect branch instruc- 
tion by this unit 142, the branch point is calculated (the first address of the first instruction at the beginning of 
the next sequential block) and is supplied to the branch pointer register 120 along line 144. The branch point 
55 is also supplied to the stop latches 50 of the fetchers. In this case, the stop signal 54 is asserted for Fetcher 
A via the condition line 145 and not for Fetcher B. Therefore Fetcher A latches the stop address so that it will 
stop fetching when that address is reached. When the branch point stored in the branch pointer register 120 
matches the address of the next instruction stored in the instruction pointer register 108, the output of the com- 
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pare unit 124 causes an effect branch signal to be asserted on line 146. This controls the instruction pointer 
multiplexor 112 to update the instruction pointer register 108 from the target pointer register 118 which, it will 
be recalled, holds the target location identified in the set branch instruction. Furthermore, the effect branch 
signal 146 is supplied to the active fetcher switch 112 which causes fetcher B to become the active fetcher c 
5 ommencing fetching from the target location so that the branch is taken. The output of the active fetcher switch 
112 controls the instruction multiplexor 104 to switch its inputs. 

If the condition which was evaluated for the confirm instruction indicates that the branch is not to be taken, 
on execution of the do instruction by the do branch execution unit 142, an invalid address is supplied on branch 
Hne-144^oH5e^torecHrrthe-stopHat 

10 to fetch sequential instructions from memory so that the branch will not be taken. Likewise, the invalid address 
stored as the branch point in the branch pointer register 120 will never match the address stored in the instruc- 
tion pointer register 108 so that compare unit will never cause the effect branch signal 146 to be asserted. 

The desription given above relating to the computer system of Figure 6 is for a system including a branch 
point register and a state indicator for holding the state of the system. As mentioned earlier, these may be omit- 

15 ted if a method such as method C is utilised in which the effect branch signal is a conditional signal located 
at the branch point. The set branch instruction sets the target location into the target pointer register 118 as 
described above. When the effect branch instruction is executed, the condition defined in the instruction is 
evaluated. If that condition is true, the effect branch signal 146 is supplied to cause fetcher B to become the 
active fetcher commencing fetching from the target location so that the branch is taken. The target location 

20 remains valid in the target pointer register 118 and, instead of fetcher A being controlled to stop fetching as 
described above, it is caused to commence fetching new instructions from the target location. This has the 
advantage that if a further branch should be required having a target location defined by the set instruction, 
the new instructions have already been fetched. 

When an effect branch instruction has been executed for which the condition defined in the instruction is 

25 true, all instructions which entered the execution unit 142 after the effect branch instruction are deleted be* 
cause they should not be executed. Instead, execution of the new instructions is commenced. 

Thus, on execution of an effect branch instruction for which the condition is satisfied, fetcher A and fetcher 
B swop over their roles. 

If an effect branch instruction is executed for which the condition determines that the branch is not to be 

30 taken, there is no change to the roie of the fetchers. 

The target location remains valid and stored in the target pointer register 118 until execution of the next 
set instruction, which resets a new target location in the target pointer register 118. 

Figure 11 is a block diagram illustrating this principal. In Figure 11, reference numerals 39 and 43 denote 
the address bus and data bus as in preceding figures. The fetch circuitry 10 in Figure 4 is implemented by an 

35 execute instruction fetcher 402 and a target instruction fetcher 404. These are denoted fetcher E and fetcher 
T respectively. Like elements in each fetcher have the same reference numeral, suffixed E or T appropriately. 
Each fetcher comprises a fetch pointer 406 for holding an indication of the next address from which instructions 
are to be fetched and a start pointer 408 for holding the target location representing the first instruction in a 
new string of instructions. Each fetcher also has a buffer 41 0 for holding a queue of instructions. 

40 Reference numeral 118 denotes as before the target pointer register which holds an indication of the target 

location responsive to execution of the set branch instruction. Reference numeral 412 denotes a select circuit 
for selecting the one of the fetchers acting as the execute Fetcher E to supply its instructions to the decode 
circuit 16 of Figure 4. 

A sequencer 414 receives the effect branch signal 146 and has state which changes on receipt of that signal 
45 to cause the role of the fetchers to alter as described above, by controlling the select circuit 412 along line 411 . 

The sequence 414 can also provide signals to the fetchers 408E,T along lines 418E.418T respectively. 
When the effect branch signal occurs, the sequencer supplies a signal on line 41 8E to cause Fetcher E to com- 
mence obtaining instructions from the target location. 

Sometimes it is useful to provide more than two instruction fetchers to take into account conditional in- 
50 structions which probably will be satisfied and conditional instructions which probably will not be satisfied. This 
is particularly the case where the execution unit is pipelined and the determination of the effect branch con- 
dition is made at a late stage in the pipelin . 

Figure 12 is a block diagram illustrating a system for implementing such an arrangement In Figure 12, 
like numerals denote like parts as in Figure 11 for the execute instruction fetcher and the target instruction 
55 fetcher and the other features common between Figures 11 and 12. 

The system of Figure 12 additionally includes a mispredict fetcher 403, fetcher M. This likewise has a fetch 
pointer register 406M for holding the address from which next instructions are to be fetched, a buffer 41 0M 
for holding a queue of instructions and a start pointer register 408M for holding the start address from which 
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instructions are to be fetched. The select circu it 412 is arranged to select the one of the three fetchers acting 
as the execute fetcher E to supply instructions to the decode circuitry 16. 

As before, the execute fetcher 402 fetches the normal sequence of instructions for supply to the execution 
unit The target fetcher 404 fetches instructions beginning from the target location, and thus holds the next 
5 branch target instruction. 

In addition to these two fetchers, the mispredict fetcher 403 fetches instructions which are located at a 
branch resulting from a conditional instruction which probably will not be satisfied. This is referred to as a pre- 
dicted to be not used arm of a branch. If it is subsequently determined that a conditional branch was predicted 

10 On execution of a set branch instruction, an indication of the target location is loaded into the target pointer 

register 11 8 as before. The target fetcher 404 is cleared and starts fetching instructions from the target location. 

When a conditional DO instruction which will probably be taken (or an unconditional DO instruction) is sent 
to the execution unit, the functions of the three fetchers is changed so that the execute fetcher 402 becomes 
the mispredict fetcher, the target fetcher 404 becomes the execute fetcher and the mispredict fetcher 403 be- 
ts comes the target fetcher. The new target fetcher (formally the mispredict fetcher fetcher M) is cleared and starts 
fetching instructions from the target location stored in the target pointer register 118. When a conditional DO 
instruction representing a condition which probably will not be met is sent to the execution unit, the roles of 
the target and mispredict fetchers alter as follows. The target fetcher fetcher T becomes the mispredict fetcher 
and the mispredict fetcher fetcher M becomes the target fetcher. The new target fetcher is cleared and starts 

20 fetching instructions at the target location stored in the target pointer register 118. 

Detection of despatch of these conditional DO instruction to the execution unit is undertaken by a detect 
unit 41 3. This supplies Predict Taken and Predict Untaken signals to the sequencer 414. 

When a DO instruction is executed by the execution unit, it is determined whether or not the branch is to 
be taken. If this agrees with the prediction, then all is well. If however the taking of the branch was incorrectly 

25 predicted, all instructions which entered the execute unit after the incorrectly predicted conditional DO instruc- 
tion are deleted, and the mispredict fetcher and execute fetcher swap roles. To this end the sequencer receives 
a Mispredicted signal from the execution unit. 

This arrangement can be implemented in a variety of different ways without affecting the concept. For ex- 
ample, when the fetchers are renamed the contents of the outgoing target fetcher could be copied to the in- 

30 coming target fetcher. This means that the new target fetcher does not waste time refetching these instructions. 
It would continue the fetching from where the outgoing target fetcher left off, provided it is not full. Alternatively, 
the fetchers need not be renamed dynamically, but their contents transferred as appropriate. This would re- 
move any requirement for state in the sequencer 414. 

The instruction fetcher and circuitry described above are capable of implementing so-called simple 

35 branches. Other, specialised type of branch instructions are also useful. One of these is kernel branches. Many 
processors have two modes of operation, one for normal programs and one for special programs. These are 
referred to as user and kernel modes. Kernal mode has more instructions available to it which are used to ma- 
nipulate the operation of the computer. This separation is required to prevent an erroneous or malicious user 
mode program from causing damage to other user mode programs. Kernel mode programs can be assumed 

40 to be correct. There is therefore a need for a method to change a programs mode from user to kernel. This is 
done by branching to a special target location, called the kernel entry point With the present invention this is 
implemented using a special set branch instruction, which does not specif y the branch target location explicitly 
but uses an implicit value. Some state must be used to specify that when the branch occurs, the processor 
must change to kernel mode. 

45 Another specialised kind of branch instruction is a so-called descriptor branch, which is a call via a pointer. 

This branch instruction specifies an address in memory, but it is not the address representing the target lo- 
cation of the branch. Instead, it is a memory location containing the target location for the branch. 

Figure 7 illustrates an instruction fetcher which can be used to implement kernel and descriptor branches. 
Like numerals in Figure 7 denote like parts in Figure 5. The fetcher of Figure 7 has the following additional 

so circuitry. A kernel latch 200 holds the address to use for kernel calling and can only be programmed by trusted 
code. It receives at its latch input a store kernel signal 202 to latch the kernel address on line 204. A kernel 
multiplexor 206 receives the kernel address atone input thereof and the normal start address on line 40 at the 
other input thereof. The kernel multiplexor 206 is controlled by a branch kernel signal on line 208. The branch 
kernel signal is asserted at the same time as the start signal 40 would normally be asserted to initialise a 

55 branch. When the signal is asserted, the address held in the kernel latch 200 is stored into the fetch latch 65 
via multiplexor 206 and a further multiplexor 208, rather than the address supplied by the start signal. 

The fetcher also includes a descriptor latch 210 which indicates whether the fetch pointer 65 holds an in- 
struction address or a descriptor address. It is controlled by the descriptor signal on line 212. When the de- 
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scriptor latch 210 indicates that the address is a descriptor address, it is loaded from data held in the buffer 
66 via a pointer register 214. The multiplexor 208 controls whether the address supplied to the fetch pointer 
65 isfrom the pointer register 214orfrom the multiplexor 206. A descriptor indication unit 216 signals whether 
or not the buffer 66 holds enough information for the pointer register 214 to hold the correct descriptor address. 

5 The fetcher of Figure 7 is thus able to carry out kernel branches and descriptor branches. 

Another type of branch instruction is procedure calling. This requires that a suitable instruction pointer val- 
ue is saved so that the procedure can return back to the piece of program from whence it came. Thus, the 
procedure can be called from different parts of the program. Figure 8 illustrates the flow graph for a procedure 

ralt^gure-8^1ustrates-a-p^ 

10 instructions Block A, Block B between which is located a call instruction. Similarly, Part 2 has sequential blocks 
of instructions Block C, Block D between which is a call instruction. The procedure includes a sequence of 
procedure instructions PROC and a return instruction. The two pieces of code, Part 1 and Part 2 both call the 
Procedure and both return to their respective control flows. The call instruction can be implemented as a par- 
ticular type of set or do branch instruction which not only identifies a target location (SET) or branch point (DO) 

15 but causes the return address of the first instruction of the next sequential block to be saved in a return register. 
Then, the return instruction can be implemented as a particular type of set instruction which effects a branch 
to the return address which was held in the register. 

Figure 9 indicates the state register required to implement procedure calls. This includes registers 230 with 
a select register unit 232 controlled by a register select signal 234. On execution of a set or do branch (or call) 

20 instruction, the address of the next instruction after the call instruction to which the program is to return is 
stored in the registers 230 on branch line 236 responsive to the store signal 238. When the special set (or re- 
turn) instruction is implemented, the branch is effected to the target location which is stored in the specified 
register 230. 

In the above described embodiment, there are two instruction fetchers which can both function as the ac- 
25 tive fetcher depending on the state of the switch multiplexor. Figure 10 illustrate in block diagram form an al- 
ternative embodiment where the instruction fetch circuit comprises two instruction fetchers, one of which is 
always the active fetcher. This embodiment will now more clearly be described with reference to Figure 10. 
Like numerals in Figure 10 denote like parts to Figure 6, but primed. Thus, Figure 10 illustrates a pipelined 
processor 17' including execution circuitry with a set branch instruction execution circuit 1 36* and a do branch 
30 instruction execution circuit 1 42\ There is an instruction pointer register 1 08* and a target pointer register 1 1 8\ 
The fetch circuit includes an active fetcher and a target fetcher. The active fetcher includes a fetch pointer 
65* and an instruction buffer 66\ The target fetcher similarly includes a fetch pointer 65" and a target instruc- 
tion buffer 66". 

On execution of a set branch instruction, the target pointer register 118' is initialised to instruct the fetch 
35 pointer 65" of the target fetcher to commence fetching instructions from the target location. Meanwhile, th 
active fetcher is fetching instructions sequentially from memory and supplying them to the processor 17*. On 
execution of the effect branch instruction, a copy unit 300 acts to copy the contents of the target instruction 
buffer 66" of the target fetcher to the instruction buffer 66' of the active fetcher so that the next instructions 
to be supplied to the processor 17* are those commencing from the target location. 
40 Further details of implementation of the circuit of Figure 1 0 are not given herein because it will be apparent 

from the information given in relation to the circuit of Figures 5, 6 and 7 how the circuit of Figure 10 could b 
implemented. 

45 Claims 

1. A computer system for fetching, decoding and executing instructions comprising: 

storage circuitry for holding a plurality of instructions at respective storage locations, said plurality 
of instructions being arranged in instruction strings, each string comprising a first instruction and a set 
so of subsequent instructions; 

instruction fetch circuitry for fetching a sequence of instructions from said storage circuitry and 
including an indicator for providing an indication of a next address at which a next fetch operation is to be 
effected; 

execution circuitry for executing fetched instructions, wherein at least some of said instruction 
55 strings each includes a set branch instruction (SET) which provides an indication of a target location from 

which a subsequent instruction may be fetched, the subsequent instruction being from a different instruc- 
tion string, and wherein said instruction fetch circuitry is operated responsive to execution of a said set 
branch instruction (SET) to fetch in parallel subsequent instructions from said string containing said set 
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branch instruction and new instructions from said different instruction string commencing from said target 
location while said subsequent instructions continue to be executed; 

a target store for holding the indication of said target location, said indication being loaded into said 
store on execution of said set branch instruction (SET) and being held in said store as a valid indication 
until execution of a subsequent set branch instruction; and 

select circuitry responsive to generation of an effect branch (DO) signal indicative that further in- 
structions to be executed are said new instructions, to cause said execution circuitry to execute said new 
instructions and to cause said instruction fetch circuitry to fetch again new instructions commencing from 
^aicttargetloratiorn 
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A computer system according to claim 1 wherein said instruction fetch circuitry comprises two instruction 
buffers, a first buffer for holding subsequent instructions connected to said execution circuitry, and a sec- 
ond buffer for holding new instructions wherein the contents of said second buffer are copied into said 
first buffer responsive to generation of said effect branch (DO) signal. 

A computer system according to claim 1 wherein said instruction fetch circuitry includes two instruction 
f etchers for fetching respectively said subsequent instructions and said new instructions and wherein said 
select circuitry is operable to connect a selected one of said instruction fetchers to said execution circuitry. 

4. A computer system according to claim 3 wherein said instruction fetch circuitry comprises a third instruc- 
^ tion fetcher for fetching instructions to implement predicted conditional instructions. 

5. A computer system according to claim 1 , 2, 3 or 4 which includes a first register for holding an indication 
of the address from which a next instruction is to be fetched. 

25 6. A computer system according to any preceding claim wherein the target store holds the address from 
which the first instruction of a string of new instructions is to be fetched. 

7. A computer system according to any of clai ms 1 to 5 wherein the set branch instruction identifies a special 
register which holds the address from which the first instruction of a string of new instructions is to be 

30 fetched. 

8. A computer system according to any of claims 1 to 5 wherein the target store holds the address of a mem- 
ory location which holds the address of the first instruction of a string of new instructions is to be fetched. 
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9. A computer system according to any preceding claim which comprises decode circuitry for decoding said 
fetched instructions, said instruction fetch circuitry, decode circuitry and execution circuitry being ar- 
ranged in a pipeline. 

10. A computer system according to any of claims 1 to 9 wherein said at least one instruction string includes 
a further instruction which determines the branch point after which new instructions are to be executed, 
identification of said branch point causing generation of said effect branch signal to said select circuitry. 

11. A computer system according to claim 10 wherein said further instruction is located at the branch point 
after which said new instructions are to be executed. 



45 12. A computer system according to claim 10 wherein said further instruction is located in the string prior to 
the branch point after which further instructions to be executed are said new instructions, said further 
instruction indicating the branch point and wherein the computer system comprises a branch point register 
for holding said branch point. 

so 13. A computer system according to claim 10, 11 or 12 wherein said further instruction is a different instruction 
from said set branch instruction. 

14. A computer system according to claim 13 wherein said further instruction defines a condition and deter- 
mines that further instructions to be executed are new instructions only if that condition is satisfied. 

55 

15. A computer syst m according to claim 5 wherein said set branch instruction identifies the branch point 
after which further instructions to be ex cuted are new instructions, said computer system comprising a 
branch point r gister for storing said branch point and compar circuitry for generating said ffect branch 
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signal to the select circuitry when said branch point matches the address indicated by the first register. 

16. A computer system according to any of claims 10 to 15 which comprises a return register for holding a 
return address being the address of the next instruction after said branch point, wherein said further in- 

5 struction is effective to generate said effect branch signal and to save said return address in said return 

register and wherein said branch instruction identifies said return register to indicate the target location. 

17. A method of operating a computer to fetch decode and execute instructions which computer has storage 
cii^iixyJiftldiP^ locations, said pluralit y of instructions be- 

10 ing arranged in instruction strings, each string comprising a first instruction and a set of subsequent in- 

structions the method comprising: 

fetching a sequence of instructions from said storage circuitry and providing an indication of a next 
address at which a next fetch operation is to be effected; 
decoding said instructions; 

15 executing each instruction in turn, wherein at least some of said instruction strings each include 

a set branch instruction (SET) which provides an indication of a target location from which a subsequent 
instruction may be fetched, the subsequent instruction being from a different instruction string; 

on execution of said set branch instruction, holding the indication of said target location in a target 
store as a valid indication until execution of a subsequent set branch instruction, fetching in parallel sub- 
20 sequent instructions from the string containing said branch instruction and new instructions from said 

different instruction string commencing from said target location; 

continuing to execute said subsequent instructions until an effect branch signal is generated which 
indicates that further instructions to be executed are said new instructions; and 

responding to said effect branch signal by commencing execution of said new instructions and 
25 fetching again new instructions commencing from said target location. 

18. A method according to claim 17 wherein said subsequent instructions are held in a first buffer and said 
new instructions are held in a second buffer and wherein said effect branch signal causes the contents 
of said second buffer to be copied into said first buffer. 

30 

19. A method according to claim 17 wherein a first instruction fetcher fetches said subsequent instructions 
and a second instruction fetcher fetches said new instructions, said effect branch signal selecting which 
of said first and second instruction fetchers supplies instructions for execution. 

20. Amethod according to claim 17, 18 or 19 wherein the effect branch signal is generated responsive to exe- 
35 cution of a further instruction which determines the branch point after which new instructions are to be 

executed. 

21. A method according to claim 20 wherein said further instruction defines a condition and determines that 
further instructions to be executed are new instructions only if that condition is satisfied. 

40 

22. A method according to any of claims 17 to 21 wherein said branch instruction identifies the branch point 
after which further instructions to be executed are new instructions and wherein said effect branch signal 
is generated when said branch point matches the address from which a next instruction is to be fetched. 

45 23. A method according to any of claims 17 to 22 wherein the set branch instruction identifies as the target 
location the address from which the first instruction of a string of new instructions is to be fetched. 

24. Amethod according to any of claims 1 7 to 22 wherein the set branch instruction identifies a special register 
which holds the target location. 

50 25. A method according to any of claims 17 to 22 wherein the set branch instruction identifies the address 
of a memory location holding the target location. 

26. Amethod according to claim 20 or21 wherein execution of said further instruction causes a return address 
to be saved in areturn register and wherein said branch instruction identifies the return registerto indicate 

55 the target location. 

27. A computer system for fetching, decoding and executing instructions comprising: 

storage circuitry for holding a plurality of instructions at respective storage locations, said plurality 
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of instructions being arranged in instruction strings, each string comprising a first instruction and a set 
of subsequent instructions; 

instruction fetch circuitry for fetching a sequence of instructions from said storage circuitry and 
including an indicator for providing an indication of a next address at which a next fetch operation is to be 
effected; 

execution circuitry for executing fetched instructions, wherein at least one of said instruction 
strings includes a set branch instruction (SET) which provides an indication of a target location from which 
a subsequent instruction may be fetched, the subsequent instruction being from a different instruction 
strirrg7m[d~an^fe^tt>rarrc^ 

point after which said new instructions are to be executed, and wherein said instruction fetch circuitry is 
operated responsive to execution of a said set branch instruction (SET) to fetch in parallel subsequent 
instructions from said string containing said set branch instruction and new instructions from said differ- 
ent instruction string commencing from said target location while said subsequent instructions continue 
to be executed; and 

select circuitry responsive to execution of an effect branch (DO) instruction to cause said execution 
circuitry to execute said new instructions if a condition determined by the effect branch instruction is sat- 
isfied. 

28. Acomputer system according to claim 27 wherein said instruction fetch circuitry comprises two instruction 
buffers, a first buffer for holding subsequent instructions connected to said execution circuitry, and a sec- 
ond buffer for holding new instructions wherein the contents of said second buffer are copied into said 
first buffer responsive to generation of said effect branch (DO) signal. 

29. A computer system according to claim 27 wherein said instruction fetch circuitry includes two instruction 
f etchers for fetching respectively said subsequent instructions and said new instructions and wherein said 
select circuitry is operable to connect a selected one of said instruction fetchers to said execution circuitry. 

30. A computer system according to claim 27, 28 or 29 which includes a first register for holding an indication 
of the address from which a next instruction is to be fetched. 

31. A computer system according to claim 27, 28, 29 or 30 which comprises a target register for holding an 
indication of the target location identified by the set branch instruction. 

32. A computer system according to claim 31 wherein the target register holds the address from which the 
first instruction of a string of new instructions is to be fetched. 

33. A computer system according to claim 31 wherein the set branch instruction identifies a special register 
which holds the address from which the first instruction of a string of new instructions is to be fetched. 

34. A computer system according to claim 31 wherein the target register holds the address of a memory lo- 
cation which holds the address of the first instruction of a string of new instructions is to be fetched. 

35. A computer system according to any of claims 27 to 34 which comprises decode circuitry for decoding 
said fetched instructions, said instruction fetch circuitry, decode circuitry and execution circuitry being 
arranged in a pipeline. 

36. A computer system according to any of claims 27 to 30 which comprises a return register for holding a 
return address being the address of the next instruction after said branch point, wherein said further in- 
struction is effective to generate said effect branch signal and to save said return address in said return 
register and wherein said branch instruction identifies said return register to indicate the target location. 

37. A method of operating a computer to fetch decode and execute instructions which computer has storage 
circuitry holding a plurality of instructions at respective storage locations, said plurality of instructions be- 
ing arranged in instruction strings, each string comprising a first instruction and a set of subsequent in- 
structions the method comprising: 

fetching a sequenc of instructions from said storage circuitry and providing an indication of a next 
address at which a next fetch operation is to be effected; 
decoding said instructions; 

executing each instruction in turn, wherein at least one of said instruction strings includes a set 
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branch instruction (SET) which provides an indication of a target location from which a subsequent in- 
struction may be fetched, the subsequent instruction being from a different instruction string; 

on execution of said set branch instruction, fetching in parallel subsequent instructions from the 
string containing said branch instruction and new instructions from said different instruction string com- 
mencing from said target location; 

continuing to execute said subsequent instructions until an effect branch instruction is executed 
which is located at the branch point after which new instructions are to be executed and which indicates 
that further instructions to be executed are said new instructions if a condition determined by the effect 
-branch-instruction-is-satisfiedrand — 

responding to said effect branch signal by commencing execution of said new instructions. 
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